GenCo: A Generative Learning Model for Heterogeneous Text Classification Based on Collaborative Partial Classifications

نویسندگان

چکیده

The use of generative learning models in natural language processing (NLP) has significantly contributed to the advancement applications, such as sentimental analysis, topic modeling, text classification, chatbots, and spam filtering. With a large amount generated each day from different sources, web-pages, blogs, emails, social media, articles, one most common tasks NLP is classification corpus. This important many institutions for planning, decision-making, creating archives their projects. Many algorithms exist automate but intriguing them that which also learns these automatically. In this study, we present new model infer learn data using probabilistic logic apply it classification. model, called GenCo, multi-input single-output (MISO) uses collaboration partial classifications generate desired output. It provides heterogeneity measure explain its results enables reduction curse dimensionality Experiments with were carried out on Twitter US Airline dataset, Conference Paper SMS Spam outperforming baseline 98.40%, 89.90%, 99.26% accuracy, respectively.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

mortality forecasting based on lee-carter model

over the past decades a number of approaches have been applied for forecasting mortality. in 1992, a new method for long-run forecast of the level and age pattern of mortality was published by lee and carter. this method was welcomed by many authors so it was extended through a wider class of generalized, parametric and nonlinear model. this model represents one of the most influential recent d...

15 صفحه اول

Designing the organizational cultural model based on the grounded theory emphasizing on collaborative learning culture

Abstract Purpose: The aim of the present study was to provide an optimal model of organizational culture in educational organization of Khorasan Razavi. Methodology: To reach this purpose, a qualitative data research method or grounded theory had been used. Fifteen experts on the subject were interviewed in order to obtain the required data in this study. After conducting interviews, data anal...

متن کامل

Pool-Based Active Learning for Text Classification

This paper shows how a text classifier’s need for labeled training documents can be reduced by employing a large pool of unlabeled documents. We modify the Query-by-Committee (QBC) method of active learning to use the unlabeled pool by explicitly estimating document density when selecting examples for labeling. Then active learning is combined with Expectation-Maximization in order to “fill in”...

متن کامل

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

DisTriB: Distributed Trust Management Model Based on Gossip Learning and Bayesian Networks in Collaborative Computing Systems

The interactions among peers in Peer-to-Peer systems as a distributed collaborative system are based on asynchronous and unreliable communications. Trust is an essential and facilitating component in these interactions specially in such uncertain environments. Various attacks are possible due to large-scale nature and openness of these systems that affects the trust. Peers has not enough inform...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13148211